HDDS-15193. Move the "atomic key creation" logic from output stream to S3 endpoints by peterxcli · Pull Request #10202 · apache/ozone

peterxcli · 2026-05-06T18:27:16Z

What changes were proposed in this pull request?

this is a no-op change

the "atomic key creation" logic in Ozone client output stream is actually only for s3g to do the length check.

removes the "atomic key creation" logic from the Ozone client output stream classes and shifts the responsibility for validating S3 object upload content length from the client layer to the S3 gateway layer. The main effect is that S3-specific length checks are now enforced in the S3 gateway, not in the general Ozone client code, leading to a cleaner separation of concerns and more maintainable code.

the name "atomic key creation" is also misleading people to think it's related to atomicity, but it's actually a integrity check for the written block data for s3g. If users want the real atomicity, they must use conditional request:

How to prevent object overwrites with conditional writes

relate to: #5524

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15193

How was this patch tested?

(Please explain how this patch was tested. Ex: unit tests, manual tests, workflow run on the fork git repo.)
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this.)

peterxcli · 2026-05-08T16:40:34Z

cc @xichen01, @ivandika3 Could you please take a look? Thanks!

chungen0126 · 2026-05-08T18:41:35Z

Thanks @peterxcli for the patch. However, I went back to review the PR for these lines, and it appears they are not related to conditional requests. They were actually added to fix a commit overwrite issue that occurs when multiple S3G instances are writing the same key.

I am concerned that removing them, as I don't think that specific problem has been resolved yet. Could you please provide a bit more detail on why these lines need to be deleted? Thanks!

Signed-off-by: peterxcli <peterxcli@gmail.com>

ivandika3

Thanks @peterxcli , overall LGTM. Using the new pre-commit hook is a good idea. Left a few nits.

…date ClientProtocolStub accordingly. Signed-off-by: peterxcli <peterxcli@gmail.com>

Signed-off-by: peterxcli <peterxcli@gmail.com>

peterxcli · 2026-05-09T10:31:03Z

I cherry-picked the refactor/fix for the test failure to #10224.

the error log is:

2026-05-09 07:21:08,033 [qtp1907241392-109] WARN server.HttpChannel: /bucket-qiyxgiosny/ozone-test-5105985444/ecmultipartKey32
javax.servlet.ServletException: javax.servlet.ServletException: java.lang.NullPointerException: Cannot invoke "org.apache.hadoop.ozone.client.io.KeyDataStreamOutput.setPreCommits(java.util.List)" because the return value of "org.apache.hadoop.ozone.client.io.OzoneDataStreamOutput.getKeyDataStreamOutput()" is null

The root cause of the EC mpu failure is RpcClient#createMultipartStreamKey detected EC and fall back to OzoneOutputStream backed by KeyOutputStream, so getKeyDataStreamOutput got NPE

chungen0126 · 2026-05-09T16:31:49Z

Thanks @peterxcli for working on this. However, I have some concerns regarding the feasibility of placing this atomic validation within the pre-commit hook. Here are my thoughts:

I believe the original issue stems from scenarios where a timeout occurs due to network instability or other unknown factors(on client side or datanode side), leading to a wrong commit.

Consider this flow: If an exception is thrown during the IO copy process because of timeout, it will jump to the catch block. At this point, the content length validation has not yet been added into the pre-commit hook, meaning the incorrect commit would still proceed.

The original atomic validation was implemented directly within the close() method, ensuring it would be executed regardless of when an exception occurred at any time. Now, it has been moved to a pre-commit hook which was added later. If an exception interrupts the process before this hook is triggered, the validation will be skipped, and an incomplete key might still be committed.

You can try testing one of the specific scenarios provided by @xichen01 for this issue.

Another similar problem:
Ozone can generate an incomplete key.

Reproduce Step:
Ozone S3
Upload the key and interrupt (ctrl + c) the upload before it completes.

An incomplete key will commit

[root@VM-8-3-centos ~]$ aws configure set default.s3.multipart_threshold 1GB
[root@VM-8-3-centos ~]$ aws s3 --endpoint-url http://localhost:9878 cp ~/500M.img s3://bucket1/500M.img

^Ccancelled: ctrl-c received
[root@VM-8-3-centos /tmp]$ aws s3 ls s3://bucket1/ --endpoint-url http://localhost:9878
2023-10-24 15:48:13 121208832 500M.img
AWS S3
Upload the key and interrupt (ctrl + c) the upload before it completes.

the incomplete key will not be committed

[root@VM-8-3-centos ~]$ aws configure set default.s3.multipart_threshold 1GB
[root@VM-8-3-centos ~]$ aws s3 cp ~/500M.img s3://bucket1/500M.img
^Ccancelled: ctrl-c received
[root@VM-8-3-centos ~]$ aws s3 ls s3://bucket1/500M.img
// nothing output

chungen0126 · 2026-05-09T18:21:47Z

cc @xichen01 As the original implementation by you. Also cc @ChenSammi @kerneltime , as you both reviewed the previous PR. I would love to hear your thoughts on this.

ChenSammi · 2026-05-12T08:23:29Z

@peterxcli , I think this move to "atomic key creation" logic cannot replace #5524. The key problem is writeToStreamOutput() will throw exception before validateContentLength is set to preCommits. You can double check the stack I shared under https://issues.apache.org/jira/browse/HDDS-9526 comments.

peterxcli · 2026-05-12T10:04:41Z

thanks, you're right, checking if the length precommit can be moved before the writeToStreamOutput

peterxcli added 2 commits May 7, 2026 02:21

HDDS-15193. Remove old atomic key creation

53a60c0

HDDS-15193. Revert unrelated design doc change

484154b

peterxcli changed the title ~~HDDS-15193. Remove old atomic key creation~~ HDDS-15193. Remove old S3 atomic key creation May 6, 2026

HDDS-15193. Remove obsolete size mismatch test

b97a8e7

peterxcli requested review from adoroszlai, sodonnel and xichen01 May 6, 2026 18:28

peterxcli marked this pull request as ready for review May 6, 2026 18:28

peterxcli requested review from chungen0126 and jojochuang May 6, 2026 18:29

HDDS-15193. Fix client checkstyle

a4e5153

peterxcli requested a review from ChenSammi May 8, 2026 06:27

use key output stream pre commit to do the content length validation

e2d2218

Signed-off-by: peterxcli <peterxcli@gmail.com>

peterxcli changed the title ~~HDDS-15193. Remove old S3 atomic key creation~~ HDDS-15193. refactor old S3 atomic key creation May 8, 2026

peterxcli changed the title ~~HDDS-15193. refactor old S3 atomic key creation~~ HDDS-15193. Keep length validation logic only in s3g module May 8, 2026

peterxcli changed the title ~~HDDS-15193. Keep length validation logic only in s3g module~~ HDDS-15193. Movesthe "atomic key creation" logic from client output to S3 Endpoint May 8, 2026

peterxcli changed the title ~~HDDS-15193. Movesthe "atomic key creation" logic from client output to S3 Endpoint~~ HDDS-15193. Movesthe "atomic key creation" logic from output stream to S3 Endpoint May 8, 2026

peterxcli changed the title ~~HDDS-15193. Movesthe "atomic key creation" logic from output stream to S3 Endpoint~~ HDDS-15193. Move the "atomic key creation" logic from output stream to S3 endpoints May 8, 2026

ivandika3 approved these changes May 9, 2026

View reviewed changes

peterxcli added 3 commits May 9, 2026 17:44

Remove unused S3 request handling from RpcClient and EndpointBase; up…

5697574

…date ClientProtocolStub accordingly. Signed-off-by: peterxcli <peterxcli@gmail.com>

streamline the precommit

34555de

Signed-off-by: peterxcli <peterxcli@gmail.com>

style

834e565

Signed-off-by: peterxcli <peterxcli@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-15193. Move the "atomic key creation" logic from output stream to S3 endpoints#10202

HDDS-15193. Move the "atomic key creation" logic from output stream to S3 endpoints#10202
peterxcli wants to merge 8 commits into
apache:masterfrom
peterxcli:codex/HDDS-15193-remove-old-atomic-key-creation

peterxcli commented May 6, 2026 •

edited

Loading

Uh oh!

peterxcli commented May 8, 2026

Uh oh!

chungen0126 commented May 8, 2026

Uh oh!

ivandika3 left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

peterxcli commented May 9, 2026

Uh oh!

chungen0126 commented May 9, 2026 •

edited

Loading

Uh oh!

chungen0126 commented May 9, 2026

Uh oh!

ChenSammi commented May 12, 2026 •

edited

Loading

Uh oh!

peterxcli commented May 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

peterxcli commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

peterxcli commented May 8, 2026

Uh oh!

chungen0126 commented May 8, 2026

Uh oh!

ivandika3 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

peterxcli commented May 9, 2026

Uh oh!

chungen0126 commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chungen0126 commented May 9, 2026

Uh oh!

ChenSammi commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peterxcli commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

peterxcli commented May 6, 2026 •

edited

Loading

ivandika3 left a comment •

edited

Loading

chungen0126 commented May 9, 2026 •

edited

Loading

ChenSammi commented May 12, 2026 •

edited

Loading

peterxcli commented May 12, 2026 •

edited

Loading